Evaluating Salience Metrics for the Context-Adequate Realization of Discourse Referents

نویسنده

  • Christian Chiarcos
چکیده

We describe the application of a framework for salience metrics and linguistic variability with respect to the contextually adequate choice of referring expressions and grammatical roles: Where multiple meaning-equivalent candidate realizations are available that differ in one of these aspects, NLG systems can apply salience metrics to predict contextually adequate realization preferences. We evaluate this claim and a number of parameters of salience metrics found in the theoretical literature on two German newspaper corpora. Key features of the approach described here include the application of a two-dimensional model of salience, how its theoretical predictions can be exploited to develop salience metrics for a particular phenomenon, and that these salience metrics can be subsequently applied to other phenomena. This approach can be applied to develop classifiers to predict packaging preferences for phenomena where little training data is available. 1 Motivation and Background For an example sentence from the RST Discourse Treebank (Carlson et al., 2003, file 3), example (1) illustrates how the same ‘thought’ can be realized, or ‘packaged’ (Chafe, 1976) in many different ways: Three referents, the insurance agent Toni, her sister Cynthia and their apartment suffer from an earthquake, the central protagonist of the paragraph is Toni, and the text goes on elaborating her situation. (1) The apartment she shares with her sister was rattled ... (a) The apartment the agent shares with her sister ... (b) The earthquake rattled the apartment she shares ... We consider two packaging phenomena: Referring expressions (1a: definite NP vs. pronoun), and grammatical roles (1b: active vs. passive).1 These variants are meaning-equivalent in the sense of Dorr et al. (2004), but according to theories of referential coherence (Sgall et al., 1986; Grosz et al., 1995; Givón, 2001), they express different discourse functions, often described with reference to the notion of ‘discourse salience’.2 Accordingly, the local discourse context – or, better, a salience score calculated on this basis – can help to predict contextually adequate packaging preferences. In NLG, discourse salience has been employed to generate referring expressions (McCoy and Strube, 1999), to assign grammatical roles (Stede, 1998), and word order preferences (Kruijff et al., 2001). More recently, however, salience-based approaches have been increasingly superseded by statistical approaches, that nevertheless build on earlier theories of salience, e.g., Shiramatsu et al. (2007) for referring expressions, Zarrieß et al. (2011) for voice alternation, and Cahill and Riester (2009) for word order. One of the reasons for this methodological shift may be the observation (noted, for example, by Along with referring expressions and grammatical roles, word order alternation has been described in a similar way, and it is of particular importance for the motivation of twodimensional models of salience (Chiarcos, 2011b). For reasons of space, however, this paper concentrates on referring expressions and grammatical roles. Discourse salience is to be distinguished from other types of salience, that are either not specific to discourse referents (e.g., salience of semantic features, Ortony et al. 1985), or defined with respect to other modalities (e.g., visual salience, Itti 2003, Kelleher 2011).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Delving Into the Details of Evaluating Public Engagement Initiatives; Comment on “Metrics and Evaluation Tools for Patient Engagement in Healthcare Organization- and System-Level Decision-Making: A Systematic Review”

Initiatives to engage the public in health policy decisions have been widely endorsed and used, yet agreed upon methods for systematically evaluating the effectiveness of these initiatives remain to be developed. Dukhanin, Topazian, and DeCamp have thus developed a useful taxonomy of evaluation criteria derived from a systematic review of published evaluation tools that might serve as the basis...

متن کامل

A Multi-Layered Discourse Analysis of Students’ Classroom Talk in Two Contexts: Rural vs. Urban

This study aimed at discussing and representing discourse analysis of classroom talk in two contexts. It is significant, since it considers different genres of talk, cultural and social identities, social relations, different ideologies and many other aspects in this analysis. It attempts to analyze the dominant classroom patterns in two contexts. Two cases of study were analyzed in this study:...

متن کامل

On the Dimensions of Discourse Salience

This paper describes results of two corpus studies of information packaging of discourse referents in German dedicated to the following questions: • Do sentence-initial position, pronominalization and subject role assignment reflect a single underlying dimension of discourse salience or multiple dimensions ? • If there are multiple dimensions of salience, is it possible to associate them with a...

متن کامل

Evidence for Gradient Salience: What Happens with Competing Non-salient Referents during Pronoun Resolution?

The necessity of a gradient approach to salience ranking of referents introduced in a discourse is evaluated by looking at (unbound) pronoun resolution preferences when there are competing non-salient referents. The study uses a sentencecompletion technique in which participants had to resolve pronouns (“John sprayed the paint on the wall and then it ...”). Results suggest that a gradient salie...

متن کامل

(De)accenting Definite Descriptions

In this paper it is shown that a definite description refers to a given discourse referent if the descriptive content is deaccented. But if there is an accent on the descriptive content, a novel referent is introduced. Starting from a uniqueness view of definiteness, a distinction between two uses of definite descriptions is proposed: ‘Given definites’ represent identity anaphors exploiting the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011